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Abstract. We give a new deterministic algorithm that non-adaptively 
learns a hidden hypergraph from edge-detecting queries. All previous 
non-adaptive algorithms either run in exponential time or have non- 
optimal query complexity. We give the first polynomial time non-adaptive 
learning algorithm for learning hypergraph that asks almost optimal 
number of queries. 


1 Introduction 

Let Q Str be a set of all labeled liypergraphs of rank at most r on the set V = 
{1,2,..., 77 .} with at most s edges. Given a hidden hypergraph G G Gs,r, we 
need to identify it by asking edge-detecting queries. An edge-detecting query 
Qg{S), for S C V is: does S contain at least one edge of G? Our objective is to 
non-adaptively learn the hypergraph G by asking as few queries as possible. 

This problem has many applications in chemical reactions, molecular biology 
and genome sequencing. In chemical reactions, we are given a set of chemicals, 
some of which react and some which do not. When multiple chemicals are com¬ 
bined in one test tube, a reaction is detectable if and only if at least one set of 
the chemicals in the tube reacts. The goal is to identify which sets react using 
as few experiments as possible. The time needed to compute which experiments 
to do is a secondary consideration, though it is polynomial for the algorithms 
we present. See I^IAlTi SKimirilSIfil 17* 11)11 m for more details 
and many other applications in molecular biology. 

In all of the above applications the rank of the hypergraph is much smaller 
than the number of edges and both are much smaller than the number of ver¬ 
tices n. Therefore, throughout the paper, we will assume that r < s and s = o{n). 

The above hypergraph learning problem is equivalent to the problem of non- 
adaptively learning a monotone DNF with at most s monomials (monotone 
term), where each monomial contains at most r variables (s-term r-MDNF) 
from membership queries m ■ In this paper we will use the later terminology 
rather than the hypergraph one. 

The adaptive learnability of s-term r-MDNF was studied in msm- In 
UJ, Abasi et. al. gave a polynomial time adaptive learning algorithm for s-term 
r-MDNF with almost optimal query complexity. The non-adaptive learnability 
of s-term r-MDNF was studied in |33I27I28I24I20I19I12) . 
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Torney , (33J, first introduced the problem and gave some applications in 
molecular biology. The first explicit non-adaptive learning algorithm for s-term r- 
MDNF was given by Gao et. ah, [24]. They show that this class can be learned us¬ 
ing (n, (s, r))-cover-free family ((n, (s, r))-CFF). This family is a set A C {0,1}” 
of assignments such that for every distinct ii, ■ ■., i s ,j l,... ,j r G {1,..., n} there 
is a G A such that = ■ • • = a* a = 0 and aj 1 = • • • = a,j r = 1. Given such a set, 
the algorithm simply takes all the monomials M of size at most r that satisfy 
(Va G A){M{a) = 1 =» /(a) = 1). It is easy to see that the disjunction of all such 
monomials is equivalent to the target function. Assuming a set of (n, (s, r))-CFF 
of size N can be constructed in time T , this algorithm learns s-term r-MDNF 
with N queries in time 0((") + T). 

In [2QI9j . it is shown that any set A C {0,1}" that non-adaptively learns 
s-term r-MDNF is an (n, (s — l,r))-CFF. Therefore, the minimum size of an 
(n, (s — 1, r))-CFF is also a lower bound for the number of queries (and therefore 
also for the time) for non-adaptively learning s-term r-MDNF. It is known, [311 , 
that any (n, (s,r))-CFF must have size at least f2(N(s, r) log n) where 



(1) 


Therefore, any non-adaptive algorithm for learning s-term r-DNF must ask at 
least N(s — 1, r) log n = f2(N(s,r) log n) queries and runs in at least I2(iV(s, r)n 
logn) time. 

Gao et. al. constructed an (n, (s, r))-CFF of size S = (2s log n/ log(s log n)) r+1 
in time O(S). It follows from [32] that an (n, (s, r))-CFF of size O ((sr) log " logn) 
can be constructed in polynomial time. A polynomial time almost optimal con¬ 
structions of size N(s, r) 1_l_0< ' 1 - ) logn for (n, (s, r))-CFF were given in [Illl0|12l23j 
which give better query complexities, but still, the above algorithms have expo¬ 
nential time complexity 0((")), when r is not constant. The latter result implies 
that there is a non-adaptive algorithm that asks Q := N(s,r ) 1+ °^ log?z queries 
and runs in exponential time 0((")). Though, when r = 0(1) is constant, the 
above algorithms run in polynomial time and are optimal. Therefore, we will 
assume r = w(l). 

Chin et. al. claim in m that they have a polynomial time algorithm that 
constructs an (n, (s,r))-CFF of optimal size. Their analysis is misleadingly The 
size is indeed optimal but the time complexity of the construction is 0(( r " )). 

But even if a (n, (s,r))-CFF can be constructed in polynomial time, the above 
learning algorithm still takes 0((")) time. 

Macula et. al., [27)28] . gave several randomized non-adaptive algorithms. We 
first use their ideas combined with the constructions of ( n , (r, s))-CFF in [11110112123] 
to give a new non-adaptive algorithm that asks A^(s, r) 1 " 1-0 ^ 1 - 1 log 2 n queries and 
runs in poly(n, N(s,r)) time. This algorithm is almost optimal in s and r but 
quadratic in logn. We then use a new technique that changes any non-adaptive 

3 Some parts of the construction can indeed be performed in polynomial time, but not 
the whole construction 
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learning algorithm that asks Q(r, s, n) queries to a non-adaptive learning al¬ 
gorithm that asks (rs) 2 ■ Q(r, s, (rs) 2 ) log n queries. This give a non-adaptive 
learning algorithm that asks N(s, r) 1 " 1 " 0 ^ logn queries and runs in nlogn • 
poly(N(s,r)) time. 

The following table summarizes the results (r = w(l)) 


References 

Query Complexity 

Time Complexity 

m 

N(s,r) ■ (rlogn/ log(s logn)) r+i 

a 

m 

N(s,r) logn 

ra 

(1110 12 23 

N(s, r) 1+ °l 1) logn 

if 

Ours+ 27 28 + [12] 

N(s, r) 1+ °W log 2 n 

poly(n, N(s, r)) 

Ours 

iV(s,r) 1+ ° (1) logn 

(n log n) ■ poly(N(s , r)) 

Ours, r = o(s) 

N(s,r) 1+ °^ logn 

(n log n) • N(s, r) 1+o(1) 

Lower Bound 20] 

N(s,r) logn 

(nlogn) • N(s,r) 


This paper is organized as follows. Section 2 gives some definitions and pre¬ 
liminary results that will be used throughout the paper. Section 3 gives the first 
algorithm that asks N(s,r) 1+ °^ 1 ' 1 log 2 n membership queries and runs in time 
poly(n,N(s,r)). Section 4 gives the reduction and shows how to use it to give 
the second algorithm that asks N(s,r) 1+ °^ 1 ' 1 log n membership queries and runs 
in time (nlogn) • N(s,r) 1+ °'- 1 ’. All the algorithms in this paper are determinis¬ 
tic. In the full paper we will also consider randomized algorithms that slightly 
improve (in the o(l) of the exponent) the query and time complexity. 

2 Definitions 

2.1 Monotone Boolean Functions 

For a vector w , we denote by Wi the ith entry of w. Let {eW | i = 1,..., n} C 
{0,1}” be the standard basis. That is, = 1 if i = j and e:-' 1 = 0 otherwise. For 
a positive integer j, we denote by [j] the set {1, 2For two assignments 
a,b £ {0, l} n we denote by (a A b) £ {0,1}™ the bitwise AND assignment. That 
is, (a A b)i = at A bi. 

Let /( X\,X 2 , ■ ■ ■ ,x n ) be a boolean function from {0,1}™ to {0,1}. For 1 < 
*i < *2 < • • • < ik < n and <7i,..., a*, £ {0,1} U { x\ ,..., x n } we denote by 

f\xi 1 i—<Tl,Xi 2 i-CT2,-",Xi k i-<Tk 

the function f(yi, ■.. ,y n ) where yq. = aj for all j £ [/c] and yi = Xi for all i £ 
[n]\{ii,... ,ik}- We say that the variable Xi is relevant in / if /| Xi< _o ^ /| Xi <- 1 - 
A variable Xi is irrelevant in / if it is not relevant in /. We say that the class is 
closed under variable projections if for every / £ C and every two variables Xi 
and Xj, i,j < n, we have /£ C. 

For two assignments a,6£{0,l}", we write a < b if for every i £ [n], < bi. 

A Boolean function / : {0,1}" —> {0,1} is monotone if for every two assignments 
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a,b G {0, l} n , if a < b then /(a) < f(b). Recall that every monotone boolean 
function / has a unique representation as a reduced monotone DNF, [TJ. That is, 
/ = Mi V M 2 V • • • V M s where each monomial Mi is an ANDs of input variables, 
and for every monomial Mi there is a unique assignment G {0,1}" such that 
f(a W) = 1 and for every j G [n] where = 1 we have f(a^\ Xj< -o) = 0. We 
call such assignment a minterm of the function /. Notice that every monotone 
DNF can be uniquely determined by its minterms pj. That is, a G {0, l} n is a 
minterm of / iff M := /\i^{j :a j=i}Xi is a monomial in /. 

An s-term r-MDNF is a monotone DNF with at most s monomials, where 
each monomial contains at most r variables. It is easy to see that the class s-term 
r-MDNF is closed under variable projections. 


2.2 Learning from Membership Queries 

Consider a teacher that has a target function / : {0,1}” —> {0,1} that is s-term 
r-MDNF. The teacher can answer membership queries. That is, when receiving 
a G {0,l} n it returns /(a). A learning algorithm is an algorithm that can ask 
the teacher membership queries. The goal of the learning algorithm is to exactly 
learn (exactly find) / with minimum number of membership queries and optimal 
time complexity. 

Let c and H D C be classes of boolean formulas. We say that C is learn- 
able from H in time T(n) with Q(n) membership queries if there is a learning 
algorithm that, for a target function / G C, runs in time T(n), asks at most 
Q{n) membership queries and outputs a function h in H that is equivalent to C. 
When H = C then we say that C is properly learnable in time T(n ) with Q(n) 
membership queries. 

In adaptive algorithms the queries can depend on the answers to the previous 
queries where in non-adaptive algorithms the queries are independent of the 
answers to the previous queries and therefore all the queries can be asked in 
parallel, that is, in one step. 

2.3 Learning a Hypergraph 

Let G s ,r be a set of all labeled hypergraphs on the set of vertices V = { 1 , 2 ,..., 71 } 
with s edges of rank (size) at most r. Given a hidden hypergraph G G Gs,r , we 
need to identify it by asking edge-detecting queries. An edge-detecting query 
Qg(S), for S C V is: does S contain at least one edge of G? Our objective is to 
learn (identify) the hypergraph G by asking as few queries as possible. 

This problem is equivalent to learning s-term r-MDNF / from membership 
queries. Each edge e in the hypergraph corresponds to the monotone term /\i^ e Xi 
in / and the edge-detecting query Qg(S) corresponds to asking membership 
queries of the assignment a where a[ S ^ = 1 if and only if i G S. Therefore, 
the class Gs.r can be regarded as the set of s-term r-MDNF. The class of s-term 
r-MDNF is denoted by Gt r - Now it obvious that any learning algorithm for Gt r 
is also a learning algorithm for G s ,r- 
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The following example shows that we cannot allow two edges e\ C e^- Let G\ 
be a graph where V\ = {1,2} and E\ = {{1},{1,2}}. This graph corresponds 
to the function / = X\ V X\X 2 that is equivalent to x± which corresponds to the 
graph G 2 where V 2 = {1,2} and E 2 = {{1}}- Also, no edge-detecting query can 
distinguish between G\ and G 2 . 

We say that A C {0,1} is an identity testing set for Q* r if for every two 
distinct s-terrn r-MDNF /1 and /2 there is a £ A such that /i(a) / / 2 (a). 
Obviously, every identity testing set for Q* r can be used as queries to non- 
adaptively learns Q* r . 

2.4 Cover Free Families 

An (n, (s,r))~ cover free family ((n, (s,r))-CFF), [22], is a set A C {0,1}" such 
that for every 1 < i\ < 12 < • • • < id, < n where d = s + r and every J C [d] of 
size | J| = s there is a € A such that a,i k = 0 for all k £ J and ai j = 1 for all 

j £ [d]\J. Denote by N(n, (s,r)) the minimum size of such set. Again here we 

assume that r < s and s = o(n). The lower bound in |3ll29j is 

N(n, (s, r)) > f2 (N(s, r) ■ logn) (2) 

where iV(s, r) is as defined in {T}. It is known that a set of random 

m = ° ( r1 ' 5 ( lo S (; + x ) ) ( N ( s > r ) • log « + log ^ ) 

= N(s, r) 1+ °l 1 l(logn + log(l/<5)) (3) 

assignments € {0,1}", where each a^ is 1 with probability r/(s + r ), is an 
(n, (s,r))-CFF with probability at least 1 — 6. 

It follows from I11I10H12I23I that there is a polynomial time (in the size of 
the CFF) deterministic construction of (n, (s,r))-CFF of size 

IV(s,r) 1+o(1) logn (4) 

where the o(l) is with respect to r. When r = o(s) the construction runs in 
linear time mm- 

2.5 Perfect Hash Function 

Let H be a family of functions h : [n] —> [(]]■ For d < q we say that H is an 
(n, q 1 d)-perfect hash family ((n, q, d)-PHF) [7] if for every subset S C [n] of size 
I SI = d there is a hash function h £ H such that h |<? is iniective (one-to-one) 
on S, i.e., \h(S)\ = d. 

In [ill Bshouty shows 

Lemma 1. Let q > 2d 2 . There is a ( n,q 1 d)-PHF of size 

d 2 log n 
log(g/d 2 ) 

that can be constructed in time 0(qd 2 n\ogn/ log(g/ d 2 )). 
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We now give the following folklore results that will be used for randomized 
learning algorithms 

Lemma 2. Let q > d(d— l)/2 be any integer. Fix any set S C [n] of d integers. 
Consider 

N , = log(l/<?) < Jog(V^)_ 

l0 ® dfF-Vj 

uniform random hash functions hi : [n] [g], i = 1,..., N where 


g(q,d) := ^1 - 




d — 1 


With probability at least 1 — 5 one of the hash functions is one-to-one on S. 


3 The First Algorithm 

In this section we give the first algorithm that asks N(s,r) 1+ ° < ' 1 ' 1 log 2 n queries 
and runs in time poly(n, N(s , r)) 

The first algorithm is based on the ideas in |27l28| that were used to give a 
Monte Carlo randomized algorithm. 

Lemma 3. Let A be an (n, (1 ,r))-CFF and B be an ( n , (s — 1 ,r))-CFF. There 
is a non-adaptive proper learning algorithm for s-term r-MDNF that asks all the 
queries in A A B := {a A b | a £ A, b € B} and finds the target function in time 
\A A B\ ■ n. 

Proof. Let / be the target function. For every b G B, let Ab = AAb := {aAb \ a G 
A}. Let If, be the set of all i G [n] such that (a A b)i > f{a A b) for all a G A. Let 
Tb := Aiei b Xi. We will show that 

1. If T is a term in / then there is b G B such that Tb = T. 

2. Either Tb = A ie [ n ]CCj or Tb is a subterm of one of terms of /. 

To prove 1, let T be a term in / and let b G B be an assignment that satisfies 
T and does not satisfy the other terms. Such assignment exists because B is 
(n, (s — l,r))-CFF. Notice that f(x A b) = T(x) = T(x A b). If Xi is in T and 
/(a A b) = 1 then T(a A b) = T(a) = f (a A b) = 1 and (a A b)i = 1. Therefore 
i G h and Xi in Tb. If Xi not in T then since A is (n, (1, r))-CFF there is a' G A 
such that T(a ') = 1 and a- = 0. Then (a'Ab)i = 0 where f(a'Ab) = 1. Therefore 
i is not in and Xi is not in Tb■ Thus, Tb = T. 

We now prove 2. We have shown in 1 that if b satisfies one term T then 
Tb = T. If b does not satisfy any one of the terms in / then f(a A b) = 0 for all 
a £ A and then Tb = A ig r n i Xi. Now suppose b satisfies at least two terms Tf and 
T 2 . Consider any variable Xi. If x % not in Tf then as before Xi will not be in Tb- 
This shows that Tb is a subterm of Tf. □ 

This gives the following algorithm 
We now have 
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Learn (Qf r ) 

1) Construct an ( n , (l,r))-CFF A and an (n, (s — l,r))-CFF B. 

2) Ask membership queries for all a A b, a £ A and b £ B. 

3) For every b G B. 

4) T b e- 1. 

5) For every i £ [n]. 

6) If for all a £ A, (a A 6)i > f(a A 6) 

7) then T b T b A a;;. 

8) T<-ru{T t }. 

9) Remove from T the term A ig [„]a;i 

and all subterm of a larger term. 


Fig. 1 . An algorithm for learning Qf r . 


Theorem 1. There is a non-adaptive proper learning algorithm for sterm r- 
MDNF that asks 

N(s, r) 1+o(1 ) log 2 n 

queries and runs in time poly(n,N(s,r)). 

Proof. Constructing a ( n , (l,r))-CFF of size |A| = r 2 log n and a (n, (s — l,r))- 
CFF of size \B\ = N(s—1, r) 1+ °( 1 ) logn = N(s, r) 1+o( - 1 ' ) logn takes poly(n, N(s, r)) 
time 1111101121231 . By Lemma 02 the learning takes time \A A B\ • n = poly 
(n,N(s,r)) time. The number of queries of the algorithm is |AAU| < |A| • \B\ = 
N(s, r) 1+0 ( is> log 2 n. □ 

4 The Second Algorithm 

In this section we give the second algorithm 
We first prove the following result 

Lemma 4. Let C be a class of boolean function that is closed under variable pro¬ 
jection. Let H be a class of boolean functions and suppose there is an algorithm 
that finds the relevant variables of f G H in time R(n). 

If C is non-adaptively learnable from H in time T(n) with Q(n ) membership 
queries then C is non-adaptively learnable from H in time 


O qd~n\ogn 


d 2 log n 
log (q/d 2 ) 


(T(q)n + R(q)) 


with 


,,i # 0 ( 5 ) , 

°U(F#) log " 


membership queries where d is an upper bound on the number of relevant vari¬ 
ables in f £ C and q > 2 d 2 . 











Proof. Consider the algorithm in Figure[2] Let A(ri) be a non-adaptive algorithm 
that learns C from H in time T(n ) with Q(n) membership queries. Let / £ C n 
be the target function. Consider the (n,q,d+ 1)-PHF P that is constructed in 
Lemma [T] (Step 1 in the algorithm). Since C is closed under variable projection, 
for every h £ P the function fh := f{xh( 1 ), • • • ,Xh( n )) is i n C q . Since the mem¬ 
bership queries to fh can be simulated by membership queries to / there is a 
set of \P\ ■ Q{q) assignments from {0,1}” that can be generated from A{q) that 
non-adaptively learn fh for all h £ P (Step 2 in the algorithm). The algorithm 
A(q) learns f' h £ H that is equivalent to fh- 

Then the algorithm finds the relevant variables of each f' h £ H (Step 3 in the 
algorithm). Let 14 be the set of relevant variables of f h and let d max = max/, |14|. 
Suppose Xi r ,.. ., Xi d ,, d' < d are the relevant variables in the target function /. 
There is a map h! £ P such that h'(i{), ..., h!{id>) are distinct and therefore f' h , 
depends on d 1 variables. In particular, d' = d ma x (Step 4 in the algorithm). 

After finding d! = d max we have: Every h for which f h depends on d! 
variables necessarily satisfies h(i{),... ,h(id’) are distinct. Consider any other 
non-relevant variable Xj qL {x q,..., Xi ,}. Since P is (n, q,d + 1)-PHF, there is 
h" £ P such that h"(j), h" (*i),..., h"(id') are distinct. Then f h „ depends on 
Xh"(i i), • • • ,Xh"{i d ,) and not in a ’h"{j)- This way the non-relevant variables can 
be eliminated. This is Step 6 in the algorithm. Since the above is true for every 
non-relevant variable, after Step 6 in the algorithm, the set X contains only the 
relevant variables of /. Then in Steps 7 and 8, the target function / can be 
recovered from any f h that satisfies |P(/io)| = d!. □ 


Algorithm Reduction I 

A(n) is a non-adaptive learning algorithm for C from H. 

1) Construct an (n, q, d + 1)-PHF P. 

2) For each h £ P 

Run A{q) to learn f h := f{x hW ,.. .,x h(n) ). 

Let f' h £ H be the output of A(q). 

3) For each h £ P 

Vh ■£- the relevant variables in f' h 

4) dmax 4 max/, |I4|- 

5) A' £- {xi,x 2 , ■ ■ ■, x„}. 

6) For each h £ P 

If |V4| — dmax then X £- A\{x; | x h (q 0 14} 

7) Take any h 0 with |14 0 | = dmax 

8) Replace each relevant variable Xi in f' h by Xj £ X where ho(j) = i. 

9) Output the function resulted in step (8). 


Fig. 2. Algorithm Reduction. 


We now prove 
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Theorem 2. There is a non-adaptive proper learning algorithm for s-term r- 
MDNF that asks 

N(s,r) 1+o( ' 1 ' 1 logn 


queries and runs in time (nlogn) ■ poly(N(s,r)) time. 


Proof. We use Lemma H) C = H is the class of s-term r-MDNF. This class is 
closed under variable projection. Given / that is s-term r-MDNF, one can find 
all the relevant variables in R{n) = poly(s) time. The algorithm in the previous 
section runs in time T(n) = poly(n, N(s, r)) and asks Q(n) = N(s , r) 1-1-0 ^' log 2 n 
queries. The number of variables in the target is bounded by d = rs. Let q = 
3r 2 s 2 > 2d 2 . By Lemma [2 there is a non-adaptive algorithm that runs in time 


O ( qd 2 nlogn + 
and asks 


d 2 log n 


(T(q)n + R(q)) ) = (n log n)poly(N(r, s)) 


log (q/d 2 ) 

°(i^$) logn ) =iv(> ' r>1+ ° lIlloe ’ 


membership queries. 
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